Some statistical and logical considerations when rescoring tests
نویسندگان
چکیده
When tests or portions of tests are scored subjectively by raters, a rescoring will yield a change in the ratings of some examinees. In a test with a xed passing score a rescoring will result in the change of some pass/fail decisions. The number of changes depends on three things: the reliability of the rating system, the proportion of examinees that initially pass, and the policy used to incorporate the rescore into the pass/fail decision. In this study, we provide a model that facilitates the evaluation of various rescoring strategies. We consider and compare the e ciency of three rescoring strategies: (1) rescore everyone, (2) rescore failures only, and (3) rescore within some range of the passing cuto . These rescoring strategies are evaluated by direct simulation. Additionally we consider the optimal allocation of rescores where the probability someone asks Eric T. Bradlow ([email protected], (215) 898-8255 (Ph), (215) 898-2534 (Fax)) is Assistant Professor of Marketing and Statistics, Wharton School of Business, 3620 Locust Walk, University of Pennsylvania, Philadelphia, PA. Howard Wainer ([email protected]) is Principal Research Scientist, Statistics and Psychometric Research Group, Educational Testing Service, Rosedale Road MS 15-T, Princeton, NJ. The authors thank Jinming Zhang for his work on earlier versions of this research. This work grew out of a discussion of new problems facing licensing exams at the monthly COPA Research Seminar. We are grateful to the participants of that seminar for their helpful discussions. This research was supported by ETS's Research allocation to the Research Statistic Group.
منابع مشابه
HERNÁNDEZ-VELA: CONTEXTUAL RESCORING FOR HUMAN POSE ESTIMATION 1 Contextual rescoring for Human Pose Estimation
A contextual rescoring method is proposed for improving the detection of body joints of a pictorial structure model for human pose estimation. A set of mid-level parts is incorporated in the model, and their detections are used to extract spatial and score-related features relative to other body joint hypotheses. A technique is proposed for the automatic discovery of a compact subset of poselet...
متن کاملIntegration of Speech to Computer-Assisted Translation Using Finite-State Automata
State-of-the-art computer-assisted translation engines are based on a statistical prediction engine, which interactively provides completions to what a human translator types. The integration of human speech into a computer-assisted system is also a challenging area and is the aim of this paper. So far, only a few methods for integrating statistical machine translation (MT) models with automati...
متن کاملLattice rescoring methods for statistical machine translation
Modern statistical machine translation (SMT) systems include multiple interrelated components, statistical models, and processes. Translation is often factored as a cascaded series of modules such that the output of one module serves as the input to the next; this is the SMT pipeline. Simplifying assumptions, limited training data, and pruning during search mean that the hypothesis produced by ...
متن کاملAn integrated model of cellular manufacturing and supplier selection considering product quality
Today’s business environment has forced manufacturers and plants to produce high-quality products at low cost and the shortest possible delivery time. To cope with this challenge, manufacturing organizations need to optimize the manufacturing and other functions that are in logical association with each other. Therefore, manufacturing system design and supplier selection process are linked toge...
متن کاملExample-based Rescoring of Statistical Machine Translation Output
Conventional statistical machine translation (SMT) approaches might not be able to find a good translation due to problems in its statistical models (due to data sparseness during the estimation of the model parameters) as well as search errors during the decoding process. This paper1 presents an example-based rescoring method that validates SMT translation candidates and judges whether the sel...
متن کامل